Evaluating Multi-lingual Information Retrieval and Clustering at ULIS

نویسندگان

  • Atsushi Fujii
  • Tetsuya Ishikawa
چکیده

This paper describes our retrieval system for NTCIR-2 Japanese/English CLIR and MLIR tasks. We integrate query and document translation with monolingual retrieval to improve retrieval accuracy, and perform clustering to improve browsing efficiency. We also introduce an entropy-driven technique in evaluating clustering methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluationg Multi-lingual Information Retrieval and Clustering at ULIS

This paper describes our retrieval system for NTCIR-2 Japanese/English CLIR and MLIR tasks. We integrate query and document translation with monolingual retrieval to improve retrieval accuracy, and perform clustering to improve browsing efficiency. We also introduce an entropy-driven technique in evaluating clustering methods.

متن کامل

NTCIR-3 Patent Retrieval Experiments at ULIS

Given the growing number of patents filed in multiple countries, users are interested in retrieving patents across languages. We propose a multi-lingual patent retrieval system, which translates a user query into the target language, searches a multilingual database for patents relevant to the query, and improves the browsing efficiency by way of machine translation and clustering. Our system a...

متن کامل

Approaching the Problem of Multi-lingual Information Retrieval and Visualization in Greek and Latin and Old Norse Texts

In this paper, we explore approaches to multi-lingual information retrieval for Greek, Latin, and Old Norse texts. We also describe an information retrieval tool that allows users to formulate Greek, Latin, or Old Norse queries in English and display the results in an innovative clustering and visualization facility.

متن کامل

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

Learning a Cross-Lingual Semantic Representation of Relations Expressed in Text

Learning cross-lingual semantic representations of relations from textual data is useful for tasks like cross-lingual information retrieval and question answering. So far, research has been mainly focused on cross-lingual entity linking, which is confined to linking between phrases in a text document and their corresponding entities in a knowledge base but cannot link to relations. In this pape...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001